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METHOD TO IDENTIFY GENE FUNCTION 
USING SMALL MOLECULE PROBES 

Technical Field 

The invention relates to gene discovery, in particular, identification of gene 
function. The invention concerns the use of phage display techniques to monitor 
ligand/receptor interactions where the ligand is a typical small molecule 
pharmaceutical. 

Background Art 

Numerous methods to detect the interaction of ligands with their receptors 
have been described. Most drug discovery endeavors appear to focus on hi^ 
througihput screening of libraries of compounds which are potential agonists or 
antagonists against a cloned receptor known to be a phannaceutically important 
target. The converse endeavor, i.e. determination of Ae natural receptor for a 
pharmaceutical or other small molecule such as a natural product, has been less 
reported. Such an approach is, however, of great significance for example, because 
identification of ligand/receptor binding for a candidate pharmaceutical would 
elucidate not only the mechanism of action of the pharmaceutical, but would also 
identify pathways by which unwanted adverse reactions might occur, and additional 
mechanisms of action. 

Previous methods of investigating small molecule - proteui interactions have 
involved the use of whole cells, which have inherent limitations. These methods 
include the use of stochastic nanodroplets as described by Borchardt, A., et at, Chem. 
Biol (1997) 4:961-968; and by Huang, J., et al, Proc. Natl Acad Set USA (1997) 
94:13396-13401; the use of a three hybrid system as described by Liberies, S.D., 
et al, Proc. Natl Acad. Sci USA (1997) 94:7825-7830 and by Licitra, EJ., etaL 
Proa Natl Acad Sci, USA (1996) 93:12817-12821; and the use of miniaturized 
formats by You, A. J., etal, Chem. Biol. (1997) 4:969-975 and Stockwell, B.R., 
et al, Chem. Biol (1999) 61-83. 
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Phage display is a widely used method for identifying proteins and the genes 
encoding them which bind to other proteins. The original system as disclosed, for 
example, in U.S. patents 5,096,815 and 5,198,346 utilized the filamentous phage M13 
and required translocation of the cloned protein across the £. coli inner membrane 
before incorporation into the phage particle. More recently, lytic bacteriophage 
vectors such as X, T4, and T7 have been used, since they are independent of £ coli 
secretion. T7 is commercially available from Novagen and a detailed description of 
the use of T7 phage is provided in the company's T7 Select® System Manual and is 
described in U.S. patents 5,223,409; 5,403,484; 5,571,698; and 5,766,905, all 
incorporated herein by reference. 

Although phage display systems have been successfully used to identify 
protein-protein interactions, it has not been reported to employ this system to identify 
protems which bind to small molecule ligands. It is unclear why it is difiBcuh for 
small molecule pharmaceuticals to select displayed proteins from a library, but this 
appears to be the case. Only recently, has it even been possible for small peptides to 
achieve tiiis. For example, Zozulya, S., et aL, Nature Biotechnol (1999) 
17: 1 193-1 198 describe the use of the T7 phage display system to map protein 
interactions Imking epidermal growth factor receptor to the Ras protein. Various 
phosphorylated protein fragments were used as bait and were immobilized on 
streptavidin-coated paramagnetic beads. 

Applicants are aware of only two papers describing the use of small peptides 
as bait; the paper by Zozulya, et aL, set forth above and Cochrane, D., et ai, J. Mol 
Biol (2000)297:889-97. 

The present invention extends the use of phage display to the interaction of 
cDNA libraries using traditional pharmaceutical-type small molecules as bait. This 
extension permits tiie direct identification of gene functionality by identifying phage 
which comjprise nucleotide sequences encoding proteins that bind non-peptide small 
molecule ligands which are of known physiological function. 

In addition to small molecule ligands whose physiological function is aheady 
known, compounds of unknown function can be used to ascertain their protein targets. 
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Such molecules may be natural products, for example, or may be members of a 
combinatorial library. 

Disclosure of the Invention 

The invention method provides a direct ^proach to identify proteins and 
genes encoding them, which proteins interact with a specific small molecule ligand. 
By use of the method of the invention, in which a multiplicity of proteins is displayed 
on bacteriophage, the phage containing nucleotide sequences that encode proteins 
capable of mteracting with a specific small molecule bait can be "pulled out" of, for 
example, a cDNA library and identified. The successfiil phage can be amplified and 
sequenced, thus identifying the gene encoding the interacting protein and providing 
the ability to deduce the amino acid sequence of the protein. 

Thus, in one aspect, the invention is directed to a method to identify a 
nucleotide sequence encoding a protein that binds a non-peptide target which method 
comprises amplifying and sequencing the nucleotide sequence contained in a 
bacteriophage, v^dierein the phage displays a protein which binds to die non-peptide 
bait. The phage is identified by a method which comprises contacting a phage display 
library comprising a multiplicity of DNA fragments encoding candidate proteins for 
said binding with said non-peptide bait, the non-peptide bait is coupled to a solid 
support imder conditions wherein said non-peptide bait interacts with a displayed 
protein to which it binds specifically; and, if necessary, washing said solid support to 
remove non-spccifically bound phage. 

The specifically bound phage is then removed by elution and amplified and 
the DNA insert encoding the displayed protein sequenced. 

In still another aspect, the invention is directed to a method to identify a 
compound which binds a protein identified by the above-recited method of the 
invention which method comprises contacting candidate compounds with the protein 
thus identified and assessing the ability of tiie compounds to interact with said protein. 
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In still another aspect, the invention is directed to compounds so identified and 
to methods of treating conditions characterized by the need to modulate the activity of 
the protein by administering the identified compound. 

Brief Description of the Drawings 

Figure 1 shows a diagram of the method of the invention with one round of 
amphfication. 

Figure 2 shows, schematically, the synthesis of biotinylated FK506. 

Figure 3 is a schematic representation of the T7 display vector and a diagram 
of the modified T7 gene. 

Figures 4A and 4B show a model combinatorial compound and a schematic 
for the synthesis of a benzyl cycloadduct as a scaffold supported bait. 

Figxire 5 shows the cotipling of the benzyl cycloadduct to an affinity support. 

Modes of Carrying Out the Invention 

The invention generally employs a phage display system to provide an 
exposed protein for assay of the interaction with a small molecule. The small 
molecule is coupled to a solid support, preferably, but not necessarily, through a 
coupling system that is subject to elution using reagents which are independent of the 
nature of the small molecule. The successfully interacting phage are thus selectively 
bound to the solid support and after washing the support if necessary to remove 
un-bound phage, can be eluted and recovered for amplification and sequencing. In 
general, phage display systems involve incorporating genes encoding non-viral 
proteins into the viral genome so as to result in the production of fusion proteins 
between the desired foreign proteins and a coat protein of the virus. Thus, when the 
virus replicates, the foreign protein is displayed at its surface. Viral replication occurs 
in host cells. In a typical phage display library, a multiplicity of DNA-encoding . 
foreign proteins is inserted and thus viral particles displaying a multiplicity of proteins 
is produced. This library of displayed protems is then contacted with "bait" typically 
bound to a solid support. The successfully interacting displayed proteins will then 
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preferentially be retained on the solid support and the non-binding proteins washed 
away. The bound protein-displaying phage are then eluted and amplified by culturing 
them in cells to produce a library enriched in proteins which bind the bait. Repeated 
rounds of amplification.and selection may be required to obtain a relatively 
homogeneous population of phage which display only the protein or proteins which 
bind with high affinity to the bait The nucleotide sequences contained in the genome 
of these phage can then be sequenced and the amino acid sequence of the binding 
protein deduced. A diagram showing, in general, the process described above is 
shown in Figure L 

In previous work, successfiil isolation of phage which bind a defined bait has 
been possible only when the bait is itself a protein. This has limited the apphcability 
of the technique; it has, for exaniple, been used fi^equendy to provide antibodies 
which bmd to target antigens. However, the present applicants have succeeded in 
applying this technique to the identification of genes encoding proteins which bind 
small molecules. This has been demonstrated as described below by the isolation of 
the gene encoding FK506 binding protem (FKBP12) from a human brain cDNA 
library using FK506 as the bait. In addition, this technique has proved successful with 
respect to members of a combinatorial library. Thus, an arbitrary member of a 
combinatorial library is coupled to the solid support and is able to "pull out" a binding 
target protein. The protein thus obtained may have a known function, or may 
constitute a new potential target for molecules that are similar to the molecule which 
exhibits specific binding to the retrieved protein. 

A particularly useful combinatorial library is disclosed herein. The hbrary is 
based on a scaffold shown in formula 1 of Figure 4A where the defined portions of 
the molecule comprise the scaffold and R1-R5 represent variable substituents which 
can be manipulated at will. The scaffold contaimng the desired substituents can 
conveniently be coupled to a solid support through a linker such as that shown in 
Figure 5. In this case, the coupling of the bait to the solid support is covalent and the 
elution of the specifically bound phage comprises the use of a solution containing a 
compound which mimics flie bait and thus displaces the coupled specifically bound 
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phage or a change in solvent conditions. A successful experiment resulting in 
retrieval of a compound which binds a benzyl cycloadduct of fonmula 1 is set forth 
below in Example 2. 

In general, the method of the invention relies on a phage display system 
wherein the inserted nucleotide sequences are typically members of a cDNA library. 
cDNA libraries can be assembled from a variety of sources, including single tissues 
such as brain, liver, stomach, prostate tissue, breast tissue, and the like from a variety 
of mammals or can be derived from any source, including yeast, invertebrates, plants, 
or prokaryotes. Libraries of the greatest interest are derived from mammals, 
particidarly humans. The tissue may be derived from an organism which is 
considered "normal," or the organism or tissue may exhibit an abnormal condition, 
such as inflanamation, tumor g*owth, hypertrophy, and the like. Techniques for 
preparing cDNA libraries are well known and such libraries can be obtained from 
very small samples, including single cells. Cell cultures may also be used. Various 
cDNA libraries are also available commercially. 

The phage system can be selected from a variety of well known display 
systems, including the classical M13 system as well as X phage, T4 and T7. 
Particularly preferred are the class of lytic phage, and m particular those in which the 
inserted nucleotide sequence tails the nucleotide sequence encoding a coat protein. 
The advantage of such tailed inserts is that the presence of a stop codon in the inserted 
nucleotide sequence does not prevent display of the encoded amino acid sequence as a 
contiguous fusion. The advantage of lytic phage resides in the ease with which the 
displayed protein can contact a selection means. One disadvantage of the M13 
systems is that the phage containing the display protein must be secreted from £ coli 
in order to contact the selection system, thus conferring an advantage on normally 
secreted proteins, for example. However, lytic phage, because the host cell is lysed, 
can be immediately contacted with, for example, an aflSnity support. 

In order for the method to be successfril, the target protem must be displayed 
on the phage at a sufficient level that it can successfully be retrieved by the small 
molecule bait. Clearly, the smaller the number of different proteins displayed in the 
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phage suspension (assuming that the protein which binds the bait is indeed included in 
the mixture) the higher the probability that the bait will capture the appropriate phage. 
Preferably, therefore, the phage display library will contain no more than 10^ different 
displayed proteins, preferably no more than 10* dilBferent displayed proteins, more 
preferably no more than 10^ different displayed proteins, and most preferably no more 
than 10* different displayed proteins. As described below, the number of different 
proteins displayed in the library is successively decreased with successive rounds of 
amplification; if the diversity of Ae library is reduced (without deleting the desired 
protem) the number of rounds of selection can be reduced as well. 

According to the method of the invention, the selecting "bait" is a small 
molecule m the classical sense of that term. Thus, the molecule includes, for 
example, pharmaceuticals which are natural products or which are prqpaied 
synthetically, but which are not peptides or proteins. Thus, the non-peptide bait 
includes polyketides such as FK506 and erythromycin, synthetic drugs such as 
cefaclor, steroids such as finasteride, and the multiplicity of nori-peptide molecules 
found in the U.S. Pharmacopoeia, the contents of w*dch are incorporated herein by 
reference, to supply a non-limiting list of small molecules included within the scope 
of the invention. 

As set fortti above, the small molecule bait need not be a pharmaceutical or 
natural product of known function, but can be a new compound or an arbitrarily 
chosen compound whose function in biological systems, if any, is not known. A 
particidarly useful source of such molecules is a combinatorial Ubrary wherein a 
multiplicity of compounds on a similar scaffold is constructed. 

The small molecule bait is used to select a phage bearing a nucleotide 
sequence which encodes a protein that interacts with the bait by contacting a 
suq)ension of phage particles with a solid support to which the bait is coupled. The 
solid siqjport can be any of a multiplicity of sudi supports, including agarose, 
polystyrene or other polyvinyl compounds, magnetic beads, and the like. The bait 
may be coupled covalently to the support or may be noncovalently bound by a system 
which permits the release of the entire complex between the bound phage and bait 



wo 01/18234 



PCT/USOO/24124 



For example, a column derivatized with N-hydroxysuccinimide can be used to couple 
covalently a carboxylic acid function on the bait molecule. For such coupling, elution 
of the bound phage would rely on competition with excess bait or other change of 
conditions. However, advantageously mild elution conditions may be used when the 
bait is coupled covalently only to a linker ligand which is itself noncovalently coupled 
to the support. Under those circumstances, the complex can be eluted in a manner not 
specific to the bait - for example, by supplying an excess of the linker. Illustrated 
below is a system wherein the bait is covalently bound to biotin which serves as a 
linker to noncovalently adsorb to an avidin derivatized column. After washing with 
buffer to remove non-boimd phage, the phage/bait complex can be removed by 
treating the colunm with excess biotin. Similarly, the bait may be covalently coupled 
to other linkers such as polyhistidine which noncovalently associates witii nickel 
chelates permitting removal or elution using excess polyhistidine, or can be coupled 
to glutathione which associates with a glutathione-S-transferase system coupled to the 
support or vice versa. A large number of such linkers which can be covalendy bound 
to bait but noncovalently bound to a derivatized solid support is known, thus 
permitting a variety of non bait-specific elution protocols. 

While this approach permits particularly mild elution conditions and may be 
advantageous, it is not a necessary feature of the invention. Covalent bonding of the 
bait to the solid support is also practical and elution can be effected by methods 
appropriate to this system. 

A first round selection occurs when the phage display library contdning the 
cDNA nucleotide sequences is contacted with the aflSnity support coupled to the non- 
peptide, small molecule bait, washed if necessary with, for example, buffer to remove 
non-specifically boxmd phage, and then eluted according to a procedure based on the 
mechanism used for coupling the bait to the afiBnity support. The eluted and thus 
selected phage members of the library can then conveniently be assessed for 
homogeneity by, for example, size separation of the cDNA inserts. If desired, the 
selected phage from the first round can be amplified and sequenced using standard 
techniques. Additional rounds of selection involve the same steps of contact with 
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affinity support labeled with bait, wash, elution, and analysis, including, if desired, 
amplification and sequencing. 

The number of rounds of selection desirable vail depend on the purpose of the 
experiment. As shown below, only two rounds of selection were required to obtain an 
apparently homogeneous population of a binding protein with a high affinity for the 
FK506 bait. More weakly bound proteins may be obtained by analyzing the results of 
the first roimd of selection. On tfie other hand, if only weakly binding proteins exist 
in the library, several rounds of selection may be necessaiy to retrieve them with 
sufficient probability . 

Retrieval of a phage which binds specifically to a smaU molecxile ligand is 
useful in many contexts. First, it provides a direct method of revealing the cellular 
targets of small molecules, including natural products. This permits classification of 
proteins into groups as well as the identification of interactions that may have 
previously been imknown with respect to a pharmaceutical, thxis revealing the nature 
of any side effects exhibited by administering the drug. Identification of the 
molecular targets involved in these side effects permits the design of formulations and 
co-administered drugs to alleviate such effects. Identification of additional targets for 
a pharmaceutical or other compound, such as a member of a combinatorial library, 
may reveal desirable interactions of these molecules with target proteins as well. 

Once the protein that is the target of a small molecide non-peptide compound 
is identified, alternative small molecules which bind to the same target and have 
similar effects, but perhaps fewer side effects, for example, can readily be found. 
Thus, the identified protein may be used in screening assays for identification of 
compounds which also interact with it Combinatorial libraries can readily be 
screened against such proteins, as well as can individual compounds. This permits 
identification of successful potential therapeutics which can also be assessed using the 
same methods for their side effects or lack thereof. For example, idaitification of the 
receptor which interacts wdth a pharmaceutical known to reduce inflammation would 
permit screening for other anti-inflanmiatory drugs. By also screening with respect to 
proteins which are responsible for side effects of the anti-inflammatory activity and 
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comparing the ability of the test compounds to interact with these as compared to the 
known drug, compounds which inhibit inflammation but which exhibit fewer side 
effects can be identified. These compounds can then be used to treat inflammation in 
lieu of the previously used compound. Thus, the invention is directed to compounds 
identified by these screening assays and to methods to treat conditions mediated by 
the target proteins identified by the method of the invention. 

The following examples are intended to illustrate but not to limit the present 
invention. 

Preparation A 
Synthesis of FK506~biotin f4^ 

This synthesis is shown diagrammatically in Figure 2. 

To a solution of Boc.FK506-TBS 2 (7.5 mg, 5,8 mmol) [Belshaw, P.J., et al, 
T?roc, Natl Acad Set USA (1996) 93:4604-4607] in 250 ml 7:1 CH 3 CN:H 2O was 
added 50 ml 48% HF in an Eppendorf tube. The solution was allowed to stir at room 
temperature until all of the starting material was consumed and ninhydrin staining 
revealed the presence of an amine with low R f (0. 1 in 3 1 . 1 ml t-butylacetate, 1 5 J ml 
glacial acetic acid, 6.0 ml n-butanol and 14.4 ml water). Upon addition of NHS-LC- 
biotin (3.3 mg, 5.8 mmol; Pierce Chemical Company, Rockford, E.) the ninhydrin 
spot immediately disappeared and a new spot with a sKghtly higher R f appeared. The 
reaction was quenched with saturated sodium bicarbonate (250 ml), extracted with 
methylene chloride (3 x 500 ml) and dried over sodium sulfete. The crude mixtui^ 
was subjected to column chromatography (firom 9: 1 to 4:1, CH 2 CI 2 :MeOH) and 
provided 1.7 mg (5.8 mmol) ofFK50WDiotin (4) (Rf 0.1 in 9:1 CH 2 CI 2 :MeOH) in 
35% yield. 
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Preparation B 
Construction of human brain cDNA phage library 

Bovine brain has traditionally been a good tissue source for the FKBP protein 
[Harding, M.W., etai, Nature (1989) 341:758-760]. As display cloning only requires 
a small amount of mRNA for library construction, hxmian brain was chosen as the 
mRNA source. Purified human briain poly A"^ mRNA (obtained jfrom normal whole 
cerebral brain tissue of a 50 year old Caucasian male, Clontech, Inc., Palo Alto, CA) 
was randomly primed in the first-strand synthesis followed by second-strand synthesis 
to yield double-stranded complimentary DNA. The cDNA was doubly digested with 
EcoRI and Hindlll endonucleases and size firactionated. The purified cDNA was then 
directionally Ugated into T7 Selectl-lb vector arms and subsequently in vitro 
packaged into phagemids (No\^igen, Madison, WI). A diagram of the manner in 
which the DNA segments are inserted is shown in Figure 3. As shown, in the T7 
System, the insert is downstream of the code protein (CP 10) so the presence of a stop 
codon in the cDNA transcript does not destroy the ability of the phage to display the 
protein. Ligation efficiencies were evaluated with a small aliquot of the packaged 
phage and indicated a total of 33 x 1 0^ plaque formuig units. These initial 
transformants were subjected to one amplification step by infecting 50 ml log phase 
BLT5403 cells, providing a human brain library of cDNA phage with an overall titer 
of 3.6 X 10^^ pfu/mL The library was stored in aliquots (100 ml) at -78°C as 8% 
glycerol stocks. 

Example 1 
Retrieval of cDNA Encodmg FKBP12 

A. To one liter of 2 x YT culture was added BLT5403 cells (fi:om a 1 ml 
saturated culture) and incubated at 37^C until cell growth reached log phase (OD 6oo = 
0.5-0.8). The culture was then inoculated with 10 ml (9 x 10* phage) of the human 
brain cDNA library stock described above. The number of phage used was 200-fold 
over &e original library diversity in order to maintain an appropriate representation of 
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all library members. It is estimated that fliere are 2.0 x 10^ cells/ml in a culture at OD 
600 = 0.5. The level of infectivity (LOI) is kept above 1000 (LOI = # cells/# of phage) 
in order to ensure complete infection. This is important since a liquid culture 
amplification was performed. The culture was incubated at 37''C until completely 
5 lysed. To achieve high-salt conditions of the phage supernatant, 1 00 ml of 5 M NaCl 

was added and the suspension centrifuged at 4700 rpm for 20 minutes at 4°C in order 
to precipitate the cell debris. This supernatant solution had a final library titer of 1 .8 x 
lO'^ pfii/ml and was used directly in the affinity selections described below. By 
inoculating 1 liter of culture with about 10^ phage, a total of lO" phagemids arc 

10 formed. As the cDNA library contains 1 0^ distinct transformants, each clone is 

represented by approximately 10^ phage particles. 

B, A bio-affinity column was prepared based on monomeric avidin boimd 
to agarose, which has a relatively low affinity for biotin (Kd approximately 10"^ mol). 
This peimits selective release of any biotin-bound species. The elution is thus 

1 5 independent of the FK506-protein interaction because the elution depends on 

competition of biotin for the entire complex containing biotinylated FK506 and 
phage. 

Three columns were prepared. One is the monomeric avidin column per se, 
the second, a biotin pre-treated column, and the third column treated with FK506 
20 biotin. 

In more detail: Three one ml aliquots of monomeric avidin resin 
(hnmunoPure Immobilized Monomeric Avidin Gel, Pierce Chemical Company, 
Rockford, IL) were incubated with either (a) phage wash buffer (P WB) which is 1 x 
phosphate buffered saline, 450 mM NaCl, 0.05% Tween-20; or (b) 5 mM biotin in 

25 PWB, or (c) FK506-biotin (4) with slow rotation for 30 minutes at 4°C, All three 

columns were then rinsed with PWB (30 ml). 

In first round selection, 100 ml which contained about 10^^ phage were poured 
over each column. Thus, each member of the cDNA library is represented by 
approximately 10^ phage. The cDNA phage supernatant (100 ml) was poured over 

30 each column, followed by washing with PWB (100 ml) and distilled water (10 ml). 
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The elution step for each column was identical and involved treatment with 10 ml of 5 
mM biotin and collecting the eluate in 0.5 ml ahquots. A small amount (10 ml) from 
each fraction was used for titer determination. 

The initial fractions from the avidin column contained a phage titer of 
approximately 40 x 10"^ pfu as compared to only 5 x 10"* pfii in the initial fractions of 
the biotin pretreated column. The titers rapidly dechned in subsequent fractions of the 
avidin column. On the other hand, the FK506 biotin treated column showed an 
elevated titer relative to the biotin control column in later fractions. 

The fractions from the FK506 selection were combined and used to inoculate 
100 ml of log phase BLT5403 cells and incubated at 37°C until completely lysed. 
After lysis, 10 ml of 5 M NaCI was added and the solution centrifiiged at 4700 rpm 
for 20 minutes at 4°C to precipitate the cell debris. The supernatant contains the 
amplified phage from the first selection, however, it still has approximately 0.5 mM 
biotin from the previous elution. The biotin was removed through precipitation of the 
phage by adding 80 ml phage precipitant (33% PEG-8000, 3,3 M NaCl) and 
centrifiiging at 7000 rpm for 10 minutes at 4*^0. The precipitated phage were 
resuspended in 30 ml PWB, and a small amount (5 ml) was stored at -78T as an 8% 
glycerol stock. An FBC506"biotin afi&nity column was prepared as described above 
and the remaining 25 ml of tiie amplified phage were poured over the colunm. The 
column was washed with PWB (100 ml) and distilled water (10 ml), then elated with 
5 mM biotin (1 0 ml). The eluate for both the first and second selection rounds were 
plated onto 2 x YT top agar and single plaques were selected for DNA analysis. 

C. Individual plaques, twenty prior to selection and five from each round 
of selection, were stabbed and used to inoculate log phase cultures of BLT5403 cells 
(10 ml) and allowed to incubate at 37°C until the culture completely lysed. Genomic 
DNA was isolated from the phage supematants using standard lambda DNA 
purification resin (Wizard Lambda Preps DNA Purification System, Promega Corp., 
Madison, Wl). Two primers, T7-up (5'-GGAGCTGtCGTATTCCAGTC-3') and T7- 
down (5'-AACCCCTCAAGACCCGTTTA-3') anneal to tiie vector sequence 
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immediately flanking the cloning site and were used with standard PCR conditions to 
evaluate the relative size qf the insert region of the isolated clones. 

As all of the clones from the second round of selection appeared to have the 
same insert size, they were subjected to DNA sequencing and were subsequently 
found to be identical clones. Sequencing reactions were performed with the T7-up 
primer using fluorescently labeled dideoxynucleotides and Taq FS DNA polymerase 
in a thermal cycling protocol, and were analyzed on an Applied Biosystems 377 DNA 
Sequencer. 

The amplified sequences contain 1 18 base pairs more than the gene insert, but 
the complete sequence of the gene insert was obtained. All of the clones firom the 
second selection included the gene encoding the known FK506 binding protem, 
FKBP12 in its expressible form. The 5' and 3' untranslated regions and the start and 
stop codons were contained in the insert between the EcoRI and Hindlll cloning sites. 
The clone obtained had the highest affinity (BQ = 0.4 nM) protein in the FBCBP family 
of homologues. Hixis, the procedure was successful, in only two rounds of selection, 
in isolating clones containing the cognate protein for FKS06. 

The assays performed above on the cDNA library cloned into T7 phage and 
after the first round of selection showed progressive homogeneity. There were no 
redundant clones in the original library and no duplicate sequences found after tfie 
first round of selection. One of the clones identified in the first round was that of the 
gene encoding FKBP12 which, as stated above, was the only clone recovered in the 
second round, out of those evaluated. 

Preparation C 
Synthesis of a bcnzvl cvcloadduct 

The synthesis of the small molecule bait which is a benzyl cycloadduct is 
outlined in Figure 4B. The diazoimide (3) was prepared by the procedure of Savinov, 
S.N.,eM/., Chem, Commun (Cambridge) (1999) 1813-1814. Treatment of 
diazoimide (3) with rhodium(II) perfluorobutyramidate in methylene chloride at room 
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temperature in the presence of benzyl vinyl ether yielded (4) in good yield (88%) and 
high diastereoselectivitiy (>98%). 

The benzyl cycloadduct was then coupled to an agarose support which had 
been activated with an N-hydroxy succinimide and linked with amilaocaproic acid as 
5 outlined m Figure 5. Treatment of (4) witii hydrazine yielded hydrazide (5). The N- 

hydroxy succinimide (NHS) activated aminocaproic acid Imked agarose support 
(3.3 mg, 0.2 mM) was treated with hydrazide (5) (20 ml, 1 00 mM, 50% 
DMSO/water) to provide benzyl scaffold agarose (6), 

Example 2 

10 Selection of Cognate Protein Binding Benzyl Cycloadduct 

The human brain cDI^A library described in Example 1 was used to inoculate 
log phase BLT5615 cells (Novagen, Inc.), containing 1 mM EPTG. The culture was 
incubated at 37T until lysis was observed. The lysate was immediately treated with a 
protease cocktail (1% v/v 8.6 mg Sigma P8465 in 200 ml 20% DMSO/water) and 
15 adjusted to 10 mM DTT, 1% BSA, and 0.5 M NaCL The solution was centrifuged at 

14000 ipm for 15 minutes at 4''C to remove cell debris. 

A 1 ml aliquot of the freshly prepared phage lysate was allowed to batch 
incubate with the probe support (6) (10 ml) for 30 minutes. The support was colunm 
washed with 1 ml of phage wash buffer (P WB) (0.05% Tween-20 in PBS) followed 
20 by batch incubation with 500 ml PWB for 5 minutes. The bound phage were batch 

elutcd by treating the support with 100 rnL of a 10 mM solution of amide (2) in Luna 
Broth contaming 1% BSA for 5 minutes. 

Six clones from the first round were randomly selected and analyzed by 
polymerase chain reaction (PCR) amplification and subsequent DNA sequencing. 
25 None of the first round clones contained an open reading frame (ORF). 

The overall first round phage titer for the specific elution with scaffold 2 was 
6x1 0^ pfu, (plaque forming imits) while the titer of the last wash was 7x1 0^ pfu. The 
lack of significant titer increase was due to the wash step using a small amount of 
detergent (0.01% Tween-20), which contributes to a higher titer in the wash solution. 
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A second round of affinity selection was performed to allow the amplification 
of any binding proteins of lower affmity that might require more than one round of 
selection m order to be detected. The second round specific elation titer increased to 
33x10^ pfii, while the last wash titer remained low (6x10^ pfii), indicating a potential 
phage enrichment 

Of four clones analyzed, one clone was found to contain the entire coding 
region of the a domain of Fi ATP synthase, an ATP binding protein. Altiiough the 
amount of background binding observed in the fibrst round was high, the second 
selection round increased the signal, as exemplified by the presence of the F| ATP 
synthase gene. 

A confirmatory experiment was performed to validate the ability of the a 
domain of Fi ATP synthase to bind the benzyl cycloadduct In a manner similar to 
that set forth above, two phage systems were mixed in a 1:1 ratio - a phage containing 
no insert and expressing no surface protein and a phage expressing the Fi a ATP 
synthase at its surface. The mixture was subjected to one round of selection as 
described above and assayed, resulting in an approximate two-fold increase in the Fi 
a ATP phage as compared to control. 
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Claims 

1 . A method to isolate a protein or its encoding nucleotide sequence 
wherein said protein bmds specifically to a non-peptide bait which method comprises 

contacting a phage display library comprising a multiplicity of different DNA 
fragments encoding candidate proteins for said binding with said non-peptide bait, 
wherein said non-peptide bait is coupled to a solid support under conditions wherein 
said non-peptide bait interacts with a displayed protein to which it binds specifically; 
and 

optionally washing said solid support to remove non-specifically bound phage. 

2. The method of claun 1 which fiirther includes eluting said specifically 
bound phage firom the solid support. 

3. The method of claun 1 wherein said phage display library is 
constructed from a lytic bacteriophage vector. 

4. The method of claun 3 wherein the lytic bacteriophage vector is T7. 

5. The method of claim 1 wherein said non-peptide bait is non-coy alently 
bound to the solid support. 

6. The method of claim 5 wherein said non-covalent binding is effected 
by biotinylated bait coupled to monomeric avidin, wherein said monomeric avidin is 
covalently bound to the solid support. 

7. The method of claim 1 wherein said multiplicity of DNA fragments 
comprises a cDNA library constructed fix)m a specialized tissue or cellular source. 

8. The method of claim 7 ^lierein said specialized tissue is brain. 
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9. The method of claim 1 wherein said multipHcity of DNA fragments is 
less than 10^ different fragments. 

10. A method to identify a nucleotide sequence encoding a protein that 
binds a non-peptide bait which method comprises amplifying and sequencing a 
nucleotide sequence contained in a bacteriophage displaying a protein which binds to 
a non-peptide bait, wherein said phage is identified by the method of claim 1. 

11. A method to produce a protein ^^ch binds specifically to a non- 
peptide bait which method comprises providing an expression system for the 
nucleotide sequence identified by the method of claim 10 to a recombinant system for 
expression of said nucleotide sequence and 

eflFecting the expression of said nucleotide sequence to produce said protein. 

12. A method to identify a compound which binds to a protein receptor, 
which method comprises contacting the protein produced by the method of claim 1 1 
with a candidate compound; assessing the ability of said protein to interact with or 
bind said candidate compound; and 

identifying as a successfiil candidate a compound which interacts with or 
binds said protein. 

13. A successfiil candidate compound identified by the method of 
claim 12. 

14. A method to treat a condition in a subject modulated by a receptor 
protein which method comprises administering to said subject the compound of 
claim 13. 

15. A method to separate a compound which specifically interacts 
noncovalently with a cognate from a mixture of compounds which do not interact 
specifically with said cognate which method comprises providing a solid support to 
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which said cognate is noncovalently coupled through an interaction between a first 
substance covalently linked to said cognate and a second substance which 
noncovalently binds the first substance; 

applying said mixture of compounds to said solid support whereby a 
compound which interacts specifically with said cognate is retained by said solid 
support; 

optionally washing said solid support to remove any nonspecifically coupled 
compounds in the mixture; and 

treating said solid support with a solution containing a sufficient concentration 
of said first substance to elute firom said support said compound associated with its 
cognate, wherein said cognate is a non-peptide small molecule and said compounds 
are phage displayed proteins. 

16. The method of claim 15 wherein said fu^st substance is biotin and said 
second substance is avidin. 

1 7. The method of claim 1 5 wherein said first substance is polyhistidine 
and said second substance is a nickel chelate. 

18. A method to isolate a protein or its encoding nucleotide sequence 
wherein said protein binds specifically to anon-peptide bait which method comprises 
eluting a phage which displays said protein firom a solid support wherein said solid 
support is coupled to said non-peptide bait 

1 9. The method of claim 1 8 wherein said non-peptide bait is noncovalently 
boimd to the solid support. 

20. The method of claim 1 9 wherein said bait is covalently linked to a first 
substance which binds noncovalently to a second substance covalently linked to said 
solid support. 
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21 . The metibod of ctaim 20 wherein the first substance is biotin and the 
second substance is avidin. 
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